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ABSTRACT 


This  paper  discusses  the  necessity  for  scientific  and 
technical  personnel  to  maintain  personal  files  and  the 
limitations  of  centralized  information  services  in 
meeting  the  diverse  and  changing  information  access 
requirements  of  these  personnel.  A  prototype  service 
is  described  that  uses  a  computer  to  aid  individuals 
organize,  maintain  and  find  what  is  in  their  files  by 
means  of  personalized,  printed  indexes.  Through 
building  a  machine -readable  record  of  user  indexing 
practice,  the  service  furnishes  a  means  of  identifying 
current  user  information  requirements  and  for  improved 
feedback  of  these  requirements  to  centralized  services. 
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USER  REQUIREMENTS,  PERSONAL  INDEXES,  AND  COMPUTER  SUPPORT 


Over  the  past  twenty  years  an  Increasing  amount  of  attention  has  been  given 
to  the  problems  of  managing  information.  Nowhere  is  the  concern  more  evident 
or  more  real  than  in  organizations  concerned  with  scientific  research  and 
development.  These  organizations  have  led  in  the  development  and  application 
of  modem  data  processing  technology  in  the  dissemination,  indexing,  storage, 
and  retrieval  of  documented,  information.  So  far,  however,  these  applications 
have  been  confined  largely  to  centralized  information  services  that  are, 
necessarily,  limited  in  their  capacity  to  respond  to  the  immensely  variegated, 
changeable,  and  time -dependent  information  requirements  of  technical  personnel. 
That  these  limitations  are  real  and  troublesome  is  reflected  in  the  increas¬ 
ing  number  and  results  of  user  studies  and  in  the  (so  far)  rather  equivocal 
attempts  to  measure  and  evaluate  the  effectiveness  of  these  systems  and 
services.  Then  there  is  the  growing  proliferation  of  small  specialized 
services  and  centers  that,  from  the  viewpoint  of  managerial  control  and 
planning,  tend  to  sprout  up  like  weeds  in  a  well-tended  garden.  An 
additional  kind  of  evidence  is  the  prevalence,  persistence,  and  size  of 
personal  and  office  files. 

Jahoda,  et  al.,  in  summarizing  previous  work,  state,  "Information  gathering 
habit  studies  have  shown  that  a  significant  portion  of  researchers  maintain 
personal  indexes.  Studies  by  Fishenden,  Tomudd,  and  Hogg  and  Smith,  for 
example,  have  brought  out  the  fact  that  45$. , .57$. » .and  66$,  respectively  of 
surveyed  scientists  had  and/or  used  personal  indexes.  Zwemer  has  found  that 
nearly  every  scientist  surveyed  in  a  recent  study  kept  a  personal  file  in 
the  way  of  reprints,  abstracts,  or  notes  on  cards,  and  that  the  average  rate 
of  growth  of  26  such  collections  is  330  times  per  year...  In 
another  recent  study  of  the  information  needs  of  Department  of  Defense 
scientists  and  engineers,  17$  of  the  interviewed  scientists  and  engineers 
used  personal  files  as  their  first  source  of  information,  while  51 $••• 
relied  on  their  local  environment--personal  files,  departmental  files,  and 
colleagues--as  a  first  source  of  information."  (l) 

One  may  conclude  from  this  and  other  evidence  that  most  of  the  scientific 
and  technical  personnel  working  in  government,  industrial,  and  academic 
environments  need  to  maintain  personal  or  office  files.  A  smaller  but  still 
appreciable  number  maintain  indexes  to  their  files  and  devote  a  sizable 
amount  of  time  and  effort  to  indexing  them. 
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The  Problem 


The  existence  of  large  working  files  has  often  been  a  source  of  concern  to 
managers  because  (a)  a»lnJ  of  Individual  files  seems  costly,  (b) 

working  files  have  a  tendency  toward  growth,  if  unrestricted,  and  (c)  the 
files  reflect  individual  vocabulary  and  filing  practices  and  cannot  readily 
be  shared  by  other  individuals.  Most  of  the  efforts  that  have  so  far  been 
directed  toward  aiding  technical  personnel  in  finding  and  obtaining  the 
information  they  need  has  concentrated  either  upon  improvements  in  centralized 
services  to  supplant  the  need  for  personal  collections  and  indexes,  or  upon 
providing  centralized  indexing  support  to  reduce  the  investment  of  highly 
paid  labor  in  such  efforts. 

It  seems  probable,  however,  that  no  matter  how  centralized  information 
services  are  improved  and  expanded  in  accessibility,  scope,  and  usefulness, 
there  will  continue  to  be  a  need  for  building  and  maintaining  personal 
collections  that  reflect  unique  individual  requirements  and  habits  of  work. 
Centralized  indexing  tends  to  impose  a  uniform  practice  and  vocabulary  that 
cannot  be  wholly  responsive  to  individual  perspectives  and  manner  of  construing 
subject  matter  within  the  context  of  a  specialized  set  of  task3  and  goals. 

There  are  also  entire  classes  of  information  and  documentation  whose 
interrelations  cannot  readily  be  provided  for  through  centralized  services. 


Consider  the  following  sequence  of  documentation  categories: 


working  notes 
laboratory  notebooks 
preliminary  sketches  and  drawings 
memoranda 

formal  drawings  or  diagrams 
data 

progress  reports 

test  reports _ 


summary  technical  reports 
papers  and  articles 


These  are  the  primary  concern  of 
indexing  and  abstracting  services, 
libraries  and  information  centers 


technical  orders  and  specifications 
operational  manuals 
maintenance  manuals 
trade  catalogs 


Here  we  have  a  sequence  in  time  typical  of  engineering  development  projects 
in  which,  as  we  progress  from  notebooks  to  memoranda  to  formal  sunnary 
reports,  detail  is  lost.  Sunnary  technical  reports,  papers  and  articles, 
particularly,  tend  to  be  the  least  informative  in  detailed  technical  content. 
In  the  course  of  a  project  a  working  engineer  will  need  to  correlate  in¬ 
formation  from  several  of  these  categories.  This  leads  to  a  necessarily 
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idiosyncratic  organization  of  the  necessary  paper  responsive  to  his  changing 
needs  in  the  course  of  a  task  or  project.  Comparable  requirements  for  the 
working  scientist  exist.  He  too  needs  information  organized  in  a  way  that  is 
responsive  to  the  changes  in  cognitive  set  that  ensue  as  an  inquiry  proceeds. 

It  is  improbable  that  any  centralized  information  service  could  serve  these 
kinds  of  transient  requirements  effectively,  if  only  because  they  cannot  be 
identified  with  sufficient  precision  and  in  good  time.  For  many  years 
various  techniques  have  been  explored  to  find  more  effective  ways  of  deter¬ 
mining  and  satisfying  the  document  and  information  requirements  of  scientific 
and  technical  personnel.  Most  user  studies,  however,  hav*»  depended  almost 
entirely  upon  the  traditional  devices  of  diary,  questionnaire  and  interview 
plus  statistical  records  of  centralized  services.  This  places  a  heavy 
reliance  on  retrospective  testimony.  Relatively  little  has  been  done  to 
study  the  actual  behavior  of  these  personnel.  In  most  organizations  it  has 
been  difficult  to  Implement  effective  feedback  mechanisms  to  centralized 
services  on  a  continuing  basis  or  to  arrive  at  reliable  means  of  measuring 
the  responsiveness  of  services. 


One  Approach  to  a  Solution 

Over  the  past  three  years  a  project  at  System  Development  Corporation  (SDC) 
has  been  concerned  with  developing  better  means  of  studying  user  behavior, 
identifying  needs  and  requirements,  and  helping  individuals  improve  the 
organization,  maintenance  and  access  to  their  personal  files.  Our  approach 
has  been  to  explore  the  use  of  a  computer  in  providing  printed  Indexes  to 
individual  collections.  A  service  was  developed  that  was  intended  to  meet 
a  diversity  of  requirements  and  be  adaptable  to  individual  viewpoints, 
vocabulary  and  habits  of  work.  An  early  version  of  the  service  has  previously 
been  reported.  (2) 

The  current  service  is  called  SURF  (Support  of  User  Records  and  Files).  SURF 
is  implemented  through  the  corporation's  MADAM  programming  language  and 
system  for  the  IBM  1^01.  (3)  Users  of  the  service  index  their  files,  fill 
out  and  submit  input  coding  sheets  to  the  service,  and  regularly  receive 
updated,  consolidated  indexes.  So  far  in  the  development  of  this  prototype 
service,  the  major  effort  has  been  spent  in  building  a  tool  that  would  be 
responsive  to  real  needs  and  would  minimize  the  effort  required  by  users  in 
coding  inputs.  This  latter  consideration  is  particularly  important  to  the 
viability  of  any  such  scheme.  The  service  must  provide  for  a  net  saving  in 
time  and  effort  to  the  individual  if  he  is  to  use  it. 

Figures  1  and  2  illustrate,  respectively,  an  input  coding  sheet  and  the 
resultant  printed  Indexes  for  a  bibliographic  example.  User  effort  at  the 
input  end  has  been  minimized  through  the  use  of  variable-length  fields 
labeled  by  field  numbers  that  identify  elements  within  each  indexed 
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entry.  Each  field  number  is  an  Arabic  numeral  followed  by  a  parenthesis. 

In  Figure  1,  field  number  l)  has  been  assigned  to  authors  and  agencies, 
field  number  3)  to  subject  keys,  and  field  number  2)  to  elements  not  to  be 
sorted  and  alphabetized.  The  field  numbers  are  automatically  deleted  from 
the  printed  indexes,  as  illustrated  in  Figure  2.  This  kind  of  free-field 
input  format  frees  the  user  from  having  to  concern  himself  with  how  his 
inputs  are  keypunched  and  processed.  There  is  no  rigid  formatting  require¬ 
ment  except  for  identifying  the  uccr  and  the  indexed  entries. 

Ideally,  such  a  service  should  help  individuals  find  more  effective  means 
of  organizing  and  accessing  files.  Most  SURF  users  have  found  that  they 
learned  to  build  better  indexes  through  having  to  U3e  the  products  of  their 
earlier  indexing  decisions.  Often  their  initial  choices  of  indexing  terms, 
order  of  entry,  etc.,  were  far  from  ideal.  Through  their  index**  tv>*»y 
could  diagnose  poor  practices  and  improve  their  grasp  of  what  is  required 
in  nomenclature  and  perspective.  At  the  same  timq  each  user  is  provided 
with  a  product  that  completely  reflects  his  outlook  and  manner  of  represen¬ 
ting  information.  The  adaptability  of  SURF  to  these  ends  is  indicated  by 
the  broad  variety  of  uses  to  which  the  service  has  been  put.  These  uses 
have  included  indexes  to  technical  literature  and  correspondence, 
engineering  data,  trade  catalog  citations,  35nm  slides,  and  a  dictionary  of 
grammar  rules  for  programmed  query  analysis.  (4)  (5) 

A  byproduct  of  the  service  has  been  the  cumulation  of  a  machine-readable 
file  of  user  indexing  practice  reflecting  user's  needs,  perspectives, 
vocabulary  and  manner  of  organizing  information  vital  to  their  work.  Such 
a  file  is  being  used  for  direct  observation  of  user  behavior  and  needs,  and 
for  feedback  to  centralized  services  for  document  description,  acquisition, 
dissemination  and  retrieval.  Analysis  of  this  kind  of  file  has  the  potential 
of  aiding  greatly  in  a  more  precise  identification  and  specification  of  user 
information  requirements.  Such  analysis  offers  an  additional  dimension  of 
study  to  the  information  derived  from  the  traditional  tools  of  diary, 
questionnaire  and  interview. 


Applicability  of  SURF 

The  approach  of  SDC's  SURF  project  has  been  that  of  developing  successive 
designs  and  programs  for  a  small  group  of  customers  representing  a  wide 
range  of  different  needs,  and  interacting  with  them  over  time  to  build  a 
truly  responsive  and  individualized  service.  In  a  sense  this  effort  could 
be  called  an  operational  experiment.  As  such  it  is  tied  to  the  particular 
concerns,  personnel  and  work  of  one  company.  It  is  probable  that  many 
organizations  would  find  it  advantageous  to  explore  more  effective  means  of 
serving  the  idiosyncratic  information  needs  of  their  scientific  and  technical 
personnel  in  a  comparable  way.  This  is  not  to  say  that  well -managed 
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Individual  files  will  eliminate  the  need  for  effective  centralized  informa¬ 
tion  and  document  services.  Working  files  and  centralized  services  are 
complementary  types  of  operations.  For  any  organization  there  i3  a  desirable 
balance  and  a  desirable  interface  between  these  enterprises.  An  operational 
exploration  comparable  to  the  SURF  development  offers  one  means  of 
identifying  ways  to  improve  services  while  aiding  the  ability  of  individual 
workers  to  serve  themselves. 


Everett  Wallace 


SURF  Input  Cooing  Sheet 
_ Room  9933 _  Ext.  6^fel 


n»te:  6/29/6 6 


ID  i 
Code  1 

1-3 

i  Output 
'  Forma  t . 

1  4-5  , 

Hi 

Card 

No. 

10 

Nusbered  Field*  ♦  Field  Contents 

11-79 

WEI 

|B1 

1  A  1 

1  i.)  Jahooa,  0.  1/  Hutchins,  R.  D.  1)  Oalford  R.  R.  .  )  character- 

0079 


0096 


letlcs  and  use  at  3)  personal  Indexes  Maintained  by  scientists  and 
engineers  2)  In  one  university.  2)  American  Dod«entatlon  vol  17 
no  l  p71-75  April  1966  1)  Florida  State  U  Faculty  2)  library 

l)  Croasley,  w.  0.  2)  Ifce  aadaa  systa*  2)  SDC  IK-21 90/002/00  2) 
fcc  19b^  3)  Madasi  orognuBdng  langtnge  and  system  3)  lbM  1U01 

orograoa  ?)  EW 

1)  Wallace,  E.  M.  3)  ran*  order  patterns  at  caason  words  as 
3)  dlscrlsdnators  of  subject  content  >)  In  scientific  and 
technical  proae  1)  Sysipoalua  on  Statistical  Association  Methods 
tor  3)  Mechanlted  Cx>eur— -tatlcn  2)  proceedings  p?2*>-229  NBS 
ml  sc  put  2o0  Dec  l^t1-  3)  language  processing  2)  Doyle 


Figure  1.  Example  of  SURF  Input 
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SURF  INDEX 


FIELD  NO. 


FOR  ME  I 


ENTRY  NO. 


CROSSiEY,  W.  0.  79 

CROSSLEY,  W.  0.  THE  MADAM  SYSTEM  SDC  TN-2198/002/00  DEC  1965 

MADAM  PROGRAMMING  LANGUAGE  AND  SYSTEM  I  BN  1401  PROGRAMS  EM 
FLORIDA  STATE  U  FACULTY  101 

JAHODA,  G.  HUTCHINS.  R.  D.  GALFORD  R.  R.  CHAR ACT ER I  ST  I CS  AND 

USE  OF  PERSONAL  INDEXES  MAINTAINED  BY  SCIENTISTS  AND  ENGINEERS 
IN  ONE  UNIVERSITY.  AMERICAN  DOCUMENTATION  VOL  17  NO  2  P71-75  A 
PRIL  1966  FLORIDA  STATE  U  FACULTY  LIBRARY 
GALFORD  K.  R.  101 

JAHODA,  G.  HUTCHINS,  R.  0.  GALFOr.D  R.  R.  CHARACTERISTICS  AND 

USE  OF  PERSONAL  INDEXES  MAINTAINED  BY  SCIENTISTS  AND  ENGINEERS 
IN  ONE  UNIVERSITY.  AMERICAN  DOCUMENTATION  VOL  17  NO  2  P71-75  A 
PRIL  1966  FLORIDA  STATE  U  FACULTY  LIBRARY 
HUTCHINS.  R.  0.  101 

J  AnflOA ,  G.  HUTCHINS.  R.  0.  GALFORD  R.  R.  CHARACTERISTICS  AND 

USE  OF  PERSONAL  INDEXES  MAINTAINED  BY  SCIENTISTS  AND  ENGINEERS 
IN  ONE  UNIVERSITY.  AFRICAN  DOCUMENTATION  VOL  17  NO  2  P71-75  A 
PRIL  1966  Fi OR  I  DA  STATE  U  FACULTY  LIBRARY 


SURF  INDEX 


FIELD  NO. 


FOR  mEI 


ENTRY  NO. 


DISCRIMINATORS  OF  SUBJECT  CONTENT  96 

MALLACE.  E.  M.  RANK  ORDER  PATTERNS  OF  COMMON  MORDS  AS  DISCRIMI 
NATORS  OF  SUBJFCT  CONTENT  IN  SCIENTIFIC  AND  TECHNICAL  PROSE  SY 
MPOSIUM  ON  STATISTICAL  ASSOCIATION  METHODS  FOR  MECHANIZED  DOCUME 
NTATION  PROCEEDINGS  P225-229  NBS  M1SC  PUB  269  DEC  1965  LANGUAG 
E  PROCESSING  OOYlS 

IBM  1401  PROGRAMS  79 

CROSSLEY,  M .  0.  THE  MAOAM  SYSTEM  SDC  TM-21 9B /DO? /OO  DEL  1965 

MADAM  PROGRAMMING  LANGUAGE  AND  SYSTEM  IBM  1401  PROGRAMS  EM 
LANGUAGE  PROCESSING  96 

MALLACE,  E.  M.  RANK  ORDER  PATTERNS  OF  COMMON  M'JAOS  AS  r>  I  SCR  I  1 1 
NATORS  OF  SUBJECT  CONTENT  IN  SCIENTIFIC  AND  TECHNICAL  PROSE  SY 
MPOSIUM  UN  STATISTICAL  ASSOCIATION  METHODS  FDR  ME  CHAN  I  7  E  0  DOCUME 

NTATION  PROCEEDINGS  P225-229  NBS  RISC  PUB  269  DEC  1965  LANGUAG 
E  PROCESSING  DOYLE 

MAOAM  PROGRAMMING  LANGUAGE  AND  SYSTEM  T9 

CROSSIEY,  M.  0.  THE  MAOAM  SYSTEM  SOC  TM- 2 l «8/002 /OO  OEC  1965 

MAOAM  PROGRAMMING  LANGOAGE  ANO  SYSTEM  ISM  1401  PROGRAMS  EM 
MECMANI 7E0  DOCUMENTATION  96 

WALLACE,  f.  M.  RANK  ORDER  PATTERNS  Of  COMMON  MOROS  AS  DISCRIMI 
NATORS  OF  SUBJECT  CONTENT  IN  SCIENTIFIC  ANO  TECHNICAL  PROSE  SY 
MPOSIUM  ON  STATISTICAL  ASSOCIATION  METHOOS  FOR  MECHANI ZED  DOCUME 

NTATION  PROCEEDINGS  P225-229  NBS  MISC  PUB  269  OEC  19*5  LANGUAG 
E  PROCESSING  DOYLE 

PERSONAL  INDEXES  MAINTAINED  BY  SCIENTISTS  AND  ENG  101 

JAHODA,  G.  HUTCHINS,  K.  D.  GALFORD  R.  R.  CHAR ACTE R I  ST  I CS  ANO 

USE  OF  PERSONAL  INDEXES  MAINTAINED  BY  SCIENTISTS  ANO  ENGINEERS 
IN  ONE  UNIVERSITY.  AMERICAN  DOCUMENTATION  VOL  I  7  NO  2  P71-75  A 
PRIL  1966  FLORIOA  STATE  U  FACULTY  LIBRARY 
RANK  ORDER  PATTERNS  OF  COMMON  MORDS  AS  9* 

MALLACE,  E.  M.  RANK  ORDER  PATTERNS  OF  COMMON  MORDS  AS  DISCRIMI 
NATORS  OF  SUBJECT  CONTENT  IN  SCIENTIFIC  ANO  TECHNICAL  PROSE  SY 
MPOSIUM  ON  STATISTICAL  ASSOCIATION  METHOOS  FOR  MECHANIZED  DOCUME 
NTATION  PMUCEEOINGS  P225-229  NBS  NISC  PUB  2*9  OEC  19*5  LANGUAG 
t  PROCESSING  DOYLE 


Figure  2.  Example  of  SURF  Index 
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